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Abstract 

Translocation of DNA through a nanopore with embedded electrodes is at the centre of 

new rapid inexpensive sequencing methods which allow distinguishing the four nucleobases by 

their different electronic structure. However, the subnanometer separation between nucleotides 

in DNA requires ultra-sharp probes. Here, we propose a device architecture consisting of a 

nanopore formed in bilayer graphene, with the two layers acting as separate electrical contacts. 

The 0.34 nm interlayer distance of graphene is ideally suited for electrical coupling to a single 

nucleobase, avoiding the difficulty of fabricating probes with subnanometer precision. The 

top and bottom graphene electrodes contact the target molecule from the same lateral side, 

removing the orders-of-magnitude tunneling current variations between smaller pyrimidine 
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bases and larger purine bases. We demonstrate that incorporating techniques for molecular 
manipulation enables the proposed device to sequence single-stranded DNA and that it offers 
even the prospect of sequencing double-stranded DNA. 

Keywords: nanopore sequencing, bilayer graphene, DNA overstretch 

Electrical monitoring of DNA translocation through a nanopore has been proposed as a fast, 
low-cost and high-throughput sequencing technique,-"^ 4 - and the past decade has witnessed tremen- 
dous progress towards that goal. 5-14 Recently, it has been suggested 15 that one could create a 
nanogap in graphene and use the resulting edges as transverse electrodes for DNA sequencing. 
This proposal holds promise for detecting each individual nucleobase on a DNA strand, since 
graphene edges can be considered to exhibit the ultimate sharpness which a transverse electri- 
cal probe could possess. Crucial experimental progress was independently established by several 
research groups who successfully demonstrated DNA translocation through fabricated graphene 
nanopores 

Although graphene nanogaps/nanopores may turn out to be the most feasible and efficient 
way for sequencing, several major challenges still remain to be addressed. One of the biggest 
difficulties is to define and align electrical contacts on a nanopore-embedded graphene sheet. If 
we resort to a conventional 2-dimensional (2-D) in-plane electrode structure, the gap between 
the probes would need to be very small, i.e., about 2 nm, to ensure that the nucleotides under 
interrogation can bridge the two electrical contacts. Otherwise, the tunneling current would drop 
below any measurable range, since it decreases exponentially with the distance between electrodes 
and molecule. However, such a precise shaping of in-plane graphene electrodes separated by a 
nanopore is expected to be technically extremely challenging. 

Here, we propose a novel architecture to address and potentially solve this issue, namely by 
using bilayer graphene as the electrical reading elements inside a nanopore as illustrated in [TJa) 
and (b): an array of nanopillars is mounted with the purpose to stretch the DNA molecule before it 
enters the nanopore, while bilayer graphene is embedded inside the nanopore as electrical contacts. 
The uniqueness of the proposed device is that here the top and bottom graphene layers serve as two 
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separate electrical contacts, and the two contacts probe the target molecule from the same lateral 
sides. Several advantages are expected to be drawn from adopting this bilayer graphene lateral con- 
tacts (in the following abbreviated as BGLC) design. First, the interlayer distance between the two 
van-der-Waals-bound graphene sheets amounts naturally to 0.34 nm, which is less than (for single- 
stranded DNA) or equal (for double-stranded DNA) to the spacing between adjacent nucleotides, 
ensuring that no more than one nucleotide should be in contact with both electrodes at a given 
time, thus readily achieving single-base resolution. This self-assembly of the bilayer graphene to 
lateral contacts separated by 0.34 nm may be regarded as an important benefit compared to pre- 
vious proposals which all require the fabrication of graphene-gaps or -ribbons with subnanometer 
precision.-i^>22r— These enormous difficulties in fabrication have so far prevented the experimental 
realization of electrically detecting individual nucleobases through graphene nanogaps or nanorib- 
bons. We emphasize that those formidable demands on processing precision are not required for 
our proposed design which should thus allow a much more straightforward implementation. Sec- 
ond, the substantial uncertainty of measurements caused by the variation in size of the different 
nucleobases in the conventional 2-D transverse electrode structure is also circumvented in our 
design. In the original proposal, the tunneling conductance through individual nucleotides was 
thought to be determined by the densities of states (DOS) of the nucleobases and the associated 
distinction could be utilized as electrical signatures of different nucleotides. 8 But, as shown in 
[TJc) and (d), the inherent volume difference of the four nucleobase types would lead to significant 
fluctuations of the coupling strength between target molecule and electrical contacts, resulting in 
orders-of-magnitude variations in the measured tunneling conductance in the low-bias region. This 
can be easily understood from the following formula which estimates the tunneling conductance 
through a molecule (here, a nucleobase) within the linear-response regime:— 1 ^ 

2e 2 T L T R 

G = — , * (1) 
h (£/-£ ) 2 

where Tl(r) represents the coupling between the nucleobase and the left (right) probe, £/ is the 
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Fermi energy of graphene, and £o is the energy of the highest occupied molecular orbital (HOMO) 
or lowest unoccupied molecular orbital (LUMO) of the nucleobase. If ^l/(r) could be assumed to 
remain constant for different nucleotides, the different HOMO (LUMO) locations of adenine, gua- 
nine, cytosine, and thymine (ADE, GUA, CYT, and THY) would indeed determine the magnitude 
of their respective conductance. However, ^l/(r) * s extremely sensitive to the distance between 
molecule and electrodes. The size variation of different nucleobases unavoidably causes changes 
in the molecule-electrodes distance, thus causing large fluctuations in ^l/(r) an ^ consequently in 
the measured tunneling conductance. 6 Contrary to this, in our design, the top and bottom graphene 
electrodes contact the to-be-scanned nucleotide from the same lateral side, as illustrated in Ee), 
rather than from two opposite sides, as shown in [Tic) and (d). Therefore, the size variation of the 
nucleobase under interrogation is not affecting the coupling and the tunneling conductance is now 
solely governed by the DOS of the nucleobase. We are going to further demonstrate that by incor- 
porating a certain type of molecule manipulation technology, optimized nucleotide discrimination 
would be attained for over- stretched DNA threading through the nanopore. 

Last but not least, the above analysis of lateral contacts implies that double-stranded DNA 
(dsDNA) could be sequenced directly by a careful design using our BGLC. By incorporating a 
suitable DNA manipulation technology, only one strand could be kept sufficiently close to BGLC 
while the other strand would always remain further away. Since the tunneling current depends 
exponentially on the distance of individual base from the electrodes, the strand close to BGLC 
will by far dominate the electrical output signal and thus get sequenced. This would be an over- 
whelming advantage compared to current sequencing proposals based on single strands which are 
severely restrained by the self-hybridization effect. 25 We are going to further explore this prospect 
and discuss the required protocol below. 

Despite the advantages mentioned above, one crucial demand by our bilayer graphene contacts 
is that the interlayer conductance of graphene go should be as small as possible so that the effective 
signal, which is the tunneling current caused by passing nucleotides, would not be immersed in 
the fluctuation of a huge go. This requirement may be fulfilled by fabricating the bilayer with a 
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relative twist with which the two layers are predicted to be electrically isolated. ' Experimen- 
tally, adjacent layers that were misoriented by only a few degrees when grown epitaxially on SiC 
were found to be very weakly electrically coupled.—" 2 ^ Thus the intrinsic interlayer conductance 
between adjacent graphene sheets might not be a severe obstacle for our proposal. 

12 a) plots the molecular orbitals of the four types of nucleotides with the Fermi energy of 
graphene indicated by a dashed-dotted line in the graph. It can be seen that HOMO rather than 
LUMO would dominate the conductance when using graphene electrodes. The above results were 
obtained by using extended Hiickel model and YAEHMOP package, and similar findings were 
shown in ab initio calculations.— 1 ^ Moreover, the locations of the HOMO of different nucleotides 
shown in this figure would determine the relative order of maximum tunneling conductance of the 
four bases. 

For an estimation of the tunneling conductance through the translocating polynucleotide, we 
adopt the single energy level model for individual nucleotides 23 shown in ??, where ^l(r) has 
been replaced by T T ^, which is the coupling between the nucleotide and the top (bottom) layer 
of graphene probe, and £o is now the energy of the HOMO of a given nucleotide. Then, the overall 
conductance by the whole strand is the sum of contributions from each nucleotide:— 

N 

G =E G o e_ ' f| ^ ?o1 (2) 

In the above expression K= 1.1 A is the decay constant of graphene, G' is the maximum tun- 
neling conductance of the i th nucleotide calculated by ??, 7q is the corresponding center-of-mass 
position of that nucleobase when optimally coupled to the electrodes, thus giving rise to the maxi- 
mum tunneling conductance G' , while r, is the actual center-of-mass location of the / nucleobase 
during the translocation. In this way, conductance of each base on the target strand is estimated 
according to the tunneling distance between the base and graphene contacts. 15 

Let us first discuss the tunneling conductance of single nucleotides dwelling in the bilayer 
graphene nanopore. The maximum conductances Gq of ADE, GUA, THY and CYT are char- 
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acterized by dashed lines in |2{b) which due to the distribution of the HOMO energies for four 
nucleotides are well separated and clearly distinguishable. However, the measured conductance in 
any real experiment would naturally exhibit significantly expanded distribution curves since there 
exists stochastic variation in the molecular positions (Ar) during the process of electrical measure- 
ment. We estimate V < Ar 2 > « 0.5 A in the confining nanopore based on molecular dynamic 
(MD) simulations 31 (an animation is included in the supporting material), and present the normal- 
ized distributions of tunneling conductance with fluctuation effects in Ob). Although there are 
overlaps, the distributions for each base remain still discernible. From the experimental point of 
view, the overlaps indicate that a single measurement would not be sufficient for distinguishing the 
four nucleobases; rather it calls for several independent measurements and subsequent discrimi- 
nating based on statistic analysis.- 1 ^ According to the formalism developed by Lagerqvist et ah, 
about 52 measurements would yield 99% accuracy (a detailed derivation is provided in supporting 
materials). 

For the whole strand sequencing, we propose that an over- stretching of the DNA molecule 
within the nanopore is essential to our BGLC system. The physical mechanism is that DNA poly- 
mers are highly flexible in the solution. That is, in the absence of any regulation of molecule 
conformation those adjacent nucleotides on the target strand would experience very large variation 
of couplings with the lateral graphene probes when passing them. The resulting huge fluctuations 
in the tunneling conductance would destroy any nucleobase identification effort since the conduc- 
tance is extremely sensitive to the change of the tunneling barrier. On the other hand, a straightened 
backbone line of over-stretched DNA could lead to more uniform and improved coupling between 
each passing-by nucleotide and bilayer probes. This is clearly demonstrated in Fig. SI within the 
supporting material. 

We first discuss a highly idealized case of single-stranded DNA (ssDNA) dynamics and the 
associated electrical sequencing, where an over- stretched DNA strand slides through the nanopore 
at constant speed in the absence of any fluctuation effect. This sets the foundation of our proposed 
approach in the sense of a test whether in the best imaginable scenario the conductance measured 
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by BGLC can be utilized for detecting individual nucleotides on the threaded strand. |3{a) plots 
the calculated time-dependent tunneling conductance G(t) of segments on a DNA strand with the 
sequence CGATCGATGT. The inset illustrates the highly idealized configuration of overstretched 
ssDNA in the bilayer graphene nanopore. The results show that different nucleotides do indeed 
have distinguishable electronic signals in this idealized case. As explained above, in this simpli- 
fied picture the relative order of conductance of the four nucleotides is dictated by their HOMO 
locations with respect to the graphene Fermi energy. 

In the following we are going to investigate the impact that fluctuations will have on the mea- 
sured conductance to see whether the above electronic signatures of different nucleotides could 
withstand such fluctuations. In principle, there are two major causes for fluctuation: 9 (1) structure 
deformation of the highly flexible DNA polymer and (2) collision with water molecules and ions 
which are inevitable in the aqueous environment. Mathematically, the mentioned fluctuation could 
be accounted for by randomized r, in ?? with a standard deviation J < Arf >. Here the magni- 



tude of J < Arf > characterizes the strength of fluctuation and in this work it was estimated by 
MD simulation (see Method and movies in the supporting material). G(t) of overstretched ssDNA 
passing through the nanopore in the presence of fluctuation was then evaluated and plotted in[3£b). 
This figure illustrates that not only signatures of different bases, but also electrical characteristics 
of the gaps between bases remain discernible. The former, nominated as "base state", while the 
latter as "gap state" are marked in|3jb). Identification of gap state is a crucial requirement towards 
whole-strand sequencing. 7 From a physics point of view, it requires that the change in conductance 
between two neighboring nucleotides is significant compared to experimental noise. Furthermore, 
the time scale for the gap state should be sufficiently large to be measurable. |3b) shows that this 
could be achieved by incorporating over-stretch of the polymer into our BGLC system. Without the 
over-stretch, the electrical characteristics of the gap between bases would become blurred, making 
data separation of each base impossible (see Fig.S2 and accompanying discussion in supporting 
material). 

Since the over-stretch of polynucleotides within the nanopore plays a fundamental role in our 
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proposed sequencing strategy, we further explore the potential experimental implementation of it 
in our design. We suggest an electrical field approach in which the field required for DNA over- 
stretch within the pore is about 60 pN/e rs 0.38 V/nm. 32 Without additional dragging force, under 
such a strong longitudinal electrical field the DNA translocation would be too fast for any practical 
measurement inside the pore. 7 We propose that the nanopillars shown in [TJa) could be utilized to 
exert the required dragging force on the polymer to balance the electrical driving force: by tuning 
configuration and surface properties of the nanopillar array, the magnitude of the dragging force 
on the passing-by polymers could be manipulated within a large range. 33 The main advantage 
of the above proposal is that the electrical field approach is a scaling-down approach for device 
integration compared to DNA stretching using optical or magnetic tweezers. 

Another potential application of our proposed device architecture is the possibility to sequence 
even dsDNA. If implemented, dsDNA sequencing would exhibit an overwhelming advantage over 
ssDNA sequencing in which the maximum strand length is severely limited by self-hybridization.— 
A careful inspection indicates that the obstacle of dsDNA sequencing with previous in-plane trans- 
verse electrodes lies in that each base-pair has to make contacts to both left and right probes to raise 
the transverse tunneling currents. This is clearly demonstrated inHJb). Consequently, the electrical 
signal is the sum of contribution from two complementary nucleotides on that pair, resulting in 
an identification of merely the whole base pair (i.e., AT vs GC), but not capable of determining 
which base belongs to which strand. On the other hand, it may be feasible that in our system the 
nucleotides on only one strand are selected by the lateral bilayer contacts during the translocation, 
as sketched in St a). By incorporating a certain molecule manipulation technology such as tethered 
to tweezers, one strand is maneuvered to be sufficiently close to the inner surface of the nanopore 
and hence gets detected by BLGC for electrical interrogation, while the other strand makes much 
more random contact and thus a trivial contribution to the overall conductance (a movie based on 
MD simulation is provided in the supporting material). This is quantitatively demonstrated inHfc) 
and (d), where the calculated time-dependent tunneling conductance G(t) of dsDNA in BLGC and 
2-D transverse electrode systems are plotted respectively. The resolution may get further enhanced 
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since our MD simulation reveals that upon strong manipulation, the force fluctuation < Ar 2 > 
of the translocating strands gets remarkably attenuated. 34 We emphasize that our computational 
results can only provide a tentative insight into the prospect of sequencing dsDNA with BLGC, 
although it seems very alluring. 

Conclusion 

In summary, we have proposed the use of adjacent layers of bilayer graphene embedded as two sep- 
arate electrodes for detecting tunneling current when driving DNA polymer through a nanopore. 
Our theoretical study has shown that single-base resolution on the target DNA strand could be 
achieved readily. If the corresponding experiment is implemented successfully, nanopore sequenc- 
ing with long- strand DNA could be performed while the complexity of the fabrication process is 
expected to be much more modest compared to other suggested device architectures. 

Method 

We performed MD simulation of single nucleotide dwelling in the bilayer graphene nanopore, 
single- stranded and double-stranded DNA translocating through the nanopore with NAMD2, and 
then extracted the time- variant atomic configuration of the target nucleotides. Details of MD simu- 
lation: the pore was made of two layers of graphene and with 2.4-nm thick silicon nitride material 
on the top and bottom as insulating layers; the pore diameter was about 4 nm; DNA molecules were 
constructed by NAMOT; The nanopore system was then solvated in TIP3 water with periodical 
boundary conditions in an NVT ensemble and with a 1 M solution of potassium and chlorine ions; 
the CHARMM27 force-field was used for DNA while UFF parameters were used for graphene 
carbon atoms and other atoms j*^ 3 - a stretching force about 300 pN was exerted on the DNA strand 
and the electrical driving field E z « 1 kcal/mol-e. The MD animation and associated time-variant 
nucleobase position f{t) were presented in Movie 1, 2 and 3 respectively. Based on these real- 
time information of molecular geometry, several important physical parameters were extracted: 
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(1) Molecular orbitals of the under- scanning nucleotide was calculated by using YAEHMOP at 
each snapshot during the translocation; (2) the average HOMO positions Ehomo of tne four bases 
were then obtained; (3) < A? 2 > of the nucleotides during the translocation was calculated. 
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Figure 1 : (a) A schematic of our proposed nanopore sequencing system where an array of nanopil- 
lars is fabricated for manipulating DNA before entering the nanopore. (b) Magnified view of 
embedded bilayer graphene electrodes for DNA sequencing, with top and bottom layers as two 
separate electrical contacts. Before DNA entering the nanopore, the nanopillar array serves for 
stretching the polymer so that the molecule would thread through in one-fold manner; after cap- 
tured into the pore, the blockage of the transmembrane ionic current triggers a switch of nanopiller 
surface properties and consequently a much stronger dragging force is exerted on the DNA where 
overstretch is expected to be induced; then, the tunneling conductance measured by the bilayer 
graphene electrodes serves as the electrical signature of nucleotides on the overstretched DNA. (c) 
and (d) Electrical interrogation of guanine and cytosine bases using in-plane graphene electrode 
structure. Here backbone atoms have been omitted to give a clearer comparison between the two 
measurements. The coupling of the molecule with the graphene contacts has been marked by the 
pink lightening symbols. The directions of tunneling currents are schematically characterized by 
yellow arrows. A nanopore fitting the size of a guanine would be Ad larger for that of a cytosine, as 
characterized in (d). (e) Electrical interrogation of adenine using bilayer graphene contacts where 
the coupling and electrical current direction are marked in similar way. 
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Figure 2: (a) Molecular orbital energies of the four types of nucleotides in the range of interest. 
The Fermi level of graphene is indicated by a dashed-dotted line, (b) Normalized distribution of 
tunneling conductance through individual nucleotides from the bilayer graphene electrodes. Here 
the full-width at half maximum of the conductance distribution is determined by the standard 
deviation of nucleotide location in the aqueous environment. The dashed lines denote maximum 
conductance Go of the four nucleotides when they dwell at the optimized position between the top 
and bottom graphene layer contacts. 
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Figure 3: (a) Tunneling conductance versus time, G(t), of a single-stranded DNA translocat- 
ing through the bilayer graphene nanopore in a highly idealized manner, i.e., the over-stretched 
polynucleotide is moved with constant speed. The segment shown here contains the base sequence 
CGATCGATGT. The inset shows schematically the idealized DNA configuration, and lateral bi- 
layer graphene electrodes, (b) G(t) of the same DNA strand taking into account the fluctuations 
caused by statistical variations in the nucleotide locations during the measurement process. The 
dashed red circle characterizes a base state, while the blue circle characterizes a gap state. 



15 



1500 2000 2500 3000 1500 2000 2500 

Time (ab. unit.) Time (ab. unit.) 



Figure 4: Sequencing double-stranded DNA with (a) bilayer graphene lateral contacts and (b) in- 
plane graphene electrodes. Coupling between the polymer and the contacts is characterized by pink 
lightening symbols. Directions of tunneling currents are schematically marked by yellow arrows, 
(c) and (d): Time-dependent tunneling conductance G(t) of (a) and (b) respectively, where black 
dashed curves correspond to the ideal case, while red curves take fluctuations into account. While 
the four bases in one strand of the double-stranded DNA are identifiable in the bilayer graphene 
setup (c), the in-plane graphene electrode can as expected only distinguish between the AT base 
pair via its lower conductance level and the GC base pair via its higher conductance level (d). 
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